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Abstract 

In this paper we consider the state complexity of an operation on formal languages, root(L). 
This naturally entails the discussion of the monoid of transformations of a finite set. We obtain 
good upper and lower bounds on the state complexity of root(L) over alphabets of all sizes. 

1 Introduction 

A deterministic finite automaton, or DFA, is a 5-tuple A = (Q,T,,5,qo,F), where Q is a finite 
non-empty set of states, £ is the finite input alphabet, 5 : Q x S — > Q is the transition function, 
qo £ Q is the initial state, and F C Q is the set of final states. We assume that 5 is defined on all 
elements of its domain. The domain of 5 can be extended in the obvious way to Q x S*, where £* 
is the free monoid over the alphabet E. For a DFA A, the set L{A) = {w £ S* : 5(qo,w) £ F} is 
said to be the language recognized by A. 

The state complexity of a regular language LCE*, denoted sc(L), is defined as the size (the number 
of states) of the smallest DFA recognizing L. The state complexity of various operations on regular 
languages, such as union, concatenation, and Kleene closure, has been studied extensively; see, for 
example, fTHIroj . 

In this paper we examine a less familiar operation, namely root(L), which is given by 

root(L) = {w £ S* : 3n > 1 such that w n G L}. 

Note that this operation is not the same as the ROOT(L) operation studied by Horvath, Leupold, 
and Lischke [7j- The study of the root(L) operation requires us to examine the connections between 
finite automata and algebra. 



For a finite set Q, a function / : Q — > Q is called a transformation. The set of all transformations of 
Q is denoted Q®. For transformations f,g £ Q®, their composition is written fg, and is given by 
{f9){ a ) = Qifil)): f° r an q £ Q- Together, the set and the composition operator form a monoid. 

Transformations and their monoids have been studied in some detail by Denes (whose work is 
summarized in and Salomaa [111 I12j. Denes investigates several algebraic and combinatorial 
properties of transformations, while much of Salomaa's work is concerned with subsets that gener- 
ate the full monoid of transformations. 



Let L be a language and A = (Q,T,, 5, qo, F) a DFA such that L = L{A). For w G E*, define 
$w (q) = 3(q,w), for all q £ Q. Then 8 W is a transformation of Q. If we denote the empty word by 
e, then 5 e is the identity transformation. 

Theorem 1.1. For a language L and a DFA A = (Q,T,,5,qo, F) with L = L(A), define the DFA 
A' = (Q Q , E, 5', q' Q , F') where q' Q = 5 t , F' = {/ : 3n > 1 such that f n (q ) G F}, and 5' is given by 

5'(f, a) = f5 a , for all f G Q Q and a G E. 

r/ien root(L) = L(^4'). 

Proof. An easy induction on w G E*, proves that 8'(q' ,w) = 5 W . Then 

x G root(L) 44> 3n > 1 : i tt Ei 
^5x(9o) G F' 

□ 

In addition to giving us a construction for a DFA recognizing root(L), the above result shows 
that this operation preserves regularity, that is, if L is regular, then root(L) is regular. Zhang 
Hi used a similar technique to characterize regularity-preserving operations. To recognize the 
image of a language under an operation, Zhang constructs a new automaton with states based on 
Boolean matrices. These matrices represent the transformations of states in the original automaton. 

The result of Theorem ll . II also allows us to give our first bound on the state complexity of root(L). 
Corollary 1.2. For regular language L, if sc(L) = n then sc(root(L)) < n n . 

Proof. This is immediate from the construction given in Theorem ll.il □ 

In the remainder of this paper we improve on this upper bound, and give a non-trivial lower bound 
for the worst-case blow-up of the state complexity of root(L), for alphabets of all sizes. These 
upper and lower bounds demonstrate that a simple, intuitive operation that preserves regularity 
can increase the state complexity of a language from n to nearly n n , even over binary alphabets. 

Our main results are given in Corollary VA.211 Corollary 1^.241 and Theorem 1^.271 

2 Unary languages 

In the case of unary regular languages, it turns out that the state complexity of the root of a 
language is bounded by the state complexity of the original language. 

Proposition 2.1. If L is a unary regular language, then sc(root(L)) < sc(L). This bound is tight. 

The idea of the following proof is that given a particular DFA recognizing L, we can modify it by 
adding states to the set of final states. The resulting DFA will recognize the language root(L). 
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Proof. Let £ = {a} be the alphabet of L. Since L is regular and unary, there exists a DFA A 
recognizing L, such that A = ({qo, ■ ■ ■ , q n -i}, {a}, 5, qo,F), where 

S(qi, a) = qi+x, for all < i < n — 1, 

and 

5(g n _i, a) = gj, for some < j < n — 1. 
We call the states qo, ■ ■ ■ , qj-i the tail, and the states qj, . . . , q n -\ the /oop. 

Notice that root(L) = {a s G S* : s | i,a G £}. For all strings a* G L, we have some k > and 
some 6 < n — 1 such that t = kl + b, where I = n — j is the number of states in the loop. Let 
s = Im + c for some m > and some < c < I. Then 



t ^ 


3r : 


Ik + b = r(lm + c) 




<^> 


3r : 


/A; — rim = rc — b 






3r : 


gcd(/, —Im) rc — b 


(by Theorem 4.3.1 of [T]) 




3r : 


I rc — b 






3r, v 


: rc — b = Iv 






3r, v 


: rc — Iv = b 






gcd(Z 


c) | b. 


(by Theorem 4.3.1 of 1 ) 



It follows that the set of divisors of the numbers of the form kl + b, k > 0, b < n — 1 is as follows: 

{Im + c G Z : m > 0, gcd(/, c) | 6}. 

These divisors can be recognized by changing the corresponding states into final states. Therefore, 
sc(root(L)) < sc(L). 

To show that this bound is tight, for n > 2 consider the language L n = {a n ~ 2 }. Under the Myhill- 
Nerode equivalence relation jJJ|, no two strings in the set {e, a, a 2 , . . . , o n_1 } are equivalent. All 
other strings in S* are equivalent to a n_1 . This gives sc(L n ) = n. Furthermore, since a n_2 is 
the longest word in root(L ra ), S(qo,a n ~ 2 ) cannot be a state in the loop. It follows that we require 
exactly n — 1 states in the tail plus a single, non- final state in the loop. Hence sc(root(L„)) = n. 
Therefore the bound is tight. □ 

3 Languages on larger alphabets 

For a regular language L C £*, if A is the minimal DFA such that L = L(A), then as we saw in 
Section ^ based on the set of all transformations of the states of A, we can construct an automaton 
A' as in Theorem ll.il to recognize root(L). Though this new DFA, A', has all transformations of 
Q as its states, it is easy to see that the only reachable states are those that are a composition of 
the transformations 5 ai , . . . , 5 am , where oi, . . . , a m G S. This set of elements, 6 ai , . . . , 5 am , and all 
of their compositions form the transformation monoid of A. We use this fact to improve on the 
upper bound of the state complexity of root(L). 



3 



Corollary 3.1. For a regular language L, let A be the smallest DFA recognizing L. Then if M is 
the transformation monoid of A, we have that sc(root(L)) < |M|. 

Proof. In Theorem ll.il the only reachable states in the construction of A' are those that belong to 



Define Z n = {1, 2, . . . , n}. Now define T n = Z^ n , the set of transformations of Z n , and S n C T n as 
the set of permutations of Z n . For 7 G T n we write 



Definition 3.2. If M C T n is the set of all compositions of the transformations fi, ■ ■ ■ , f m S T n , 
then we say that {fx, . . . , / m } generates M. 

Definition 3.3. For 7 G T n , define the image of 7 by img(7) = {y G Z n : y = 7(2;) for some z G 



Definition 3.4. For 7 £ T n , define the ran/; of 7 as the number of distinct elements in the image 
of 7, and denote it by rank(7). 

The relationship between the state complexity of a language and the transformation monoid natu- 
rally leads to the question of how large a submonoid of T n can be generated by m elements, where 
m is a positive integer. In connection with the study of Landau's function (for a survey see |§lllL)|). 
Szalay JB] showed that, for m = 1, the largest submonoid of T n has size 



In the case where m > 3, the results are well known. 

Lemma 3.5. For n > 3, suppose H CT„ such that H generates T n . Then \H\ > 3. Furthermore, 
\H\ = 3 if and only if H can be written as H = {a, (3, 7}, where {a, (3} generates S n and rank(7) = 
n — 1. 

For a proof of this lemma, see Denes [2J. This gives us the result that the largest submonoid 
generated by three elements has the full size n n . 

Contrary to the case for m = 1 and m > 3, it seems that only recently there has been any interest 
in determining the largest submonoid on two generators. Significant progress has been made in 
this area by Holzer and Konig [U[3]) an< ^' independently, by Krawetz, Lawrence, and Shallit |S]. 
The results of Holzer and Konig are summarized here. 

For coprime integers k, I > 2, where k + I = n, let a = (1 2 • • • k)(k + 1 k + 2 • • • n) be a 
permutation of Z n composed of two cycles, one of length k, the other of length I. Define U^i to be 
the set of all transformations 7 G T n where exactly one of the following is true: 

1. 7 = a m for some positive integer m; 



the transformation monoid of A. 



□ 




Z n }. 
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2. For some i £ {1, • • • , k} and some j £ {k + 1, ■ ■ ■ , n} we have that 7(1) = 7(&) and for some 
m € {k + 1, • • • , n} we have that m img(7). 



Let 7Ti = (1 2 • • • A;) be an element of and let 7r 2 S S^-l be a permutation such that m and 

7T2 generate S n -l- Now define (3 € T n by 



Lemma 3.6 (Holzer and Konig). T/ie se£ Uf-i is a submonoid ofT n and is generated by {a, /?}. 

It is worth noting that in their definition of U^i, Holzer and Konig allow k = 1 and 1 = 1. They 
show implicitly, however, that the size of the monoid in these degenerate cases is too small to be 
of any consequence here. 

Theorem 3.7 (Holzer and Konig). Forn > 7 , there exist coprime integers k,l such that n = k+l 
and 



In addition to a lower bound on the size of the largest two-generated monoid, Theorem 13. 71 gives us 
the existence of a sequence of two-generated monoids whose size approaches n n as n tends toward 
infinity. Similar results were obtained independently by Krawetz, Lawrence, and Shallit |S]. 

More recently, Holzer and Konig 5| proved the following result regarding the maximality of monoids 
of the form Uk,i- 

Theorem 3.8 (Holzer and Konig). For all prime numbers n > 7, there exist coprime integers 
k,l > 2 such that k + 1 = n and U^i is the largest two-generated submonoid of T n . 

They also stated the following conjecture. 

Conjecture 3.9 (Holzer and Konig). For any n>7, there exist coprime integers k,l > 2 such 
that k + 1 = n and U^i is the largest two-generated submonoid ofT n . 

Since the connection between the state complexity of root(L) and the transformation monoid of 
L has been established in Corollary 13.11 we can take advantage of the results of Theorem 13.71 and 
Theorem 13.81 if we can construct a language based on a monoid. By associating an alphabet with 
the generators of a monoid, we can define a transition function for a DFA. The definition of the 
DFA is then completed by choosing a start state and a set of final states. This construction is given 
more formally below. 

Let n,m be integers with n,m > 1. For a set of transformations X = {a±, . . . ,a m }, let M C T n 
denote the monoid generated by X. Then a DFA based on X is a DFA M = (Z n , S, 5, zq,F), where 
|E| > m, zq £ Z n , F C Z n , and 5 is given by 




n — 1 
7r 2 (n - 1) 





S a = *(a), for all a£S, 



for some map f :S-*IU {8 e } that is surjective on X. 
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Proposition 3.10. Let Ai = (Z n , E, 5, zq, F) be a DFA. Then M is the transformation monoid of 
A4 if and only if Ai is based X , for some ICT„ that generates the monoid M . 

Proof. For a DFA Ai based on X, the fact that M is the transformation monoid of Ai is immediate 
from the construction. For any DFA Ai that has M as its transformation monoid, we have that 
the set {5 a G T n : a G £} generates M. Then we can simply take ^ given by ^(a) = 5 a , for all 
a G S. □ 

In particular, let Ay t x = (-Zn> E, 5, zo,F) denote the DFA based on X when zq = 1, F = {1}, and 
is bijective on an m-element subset of X, with all other elements of £ mapped to 5 e . If and 
^2 are maps over the same domain, then A$, lt x is isomorphic to A^ 2j x, up to a renaming of the 
states and alphabet symbols. For this reason, we will often denote this DFA simply by At, x- 



Example 3.11. Let Y = {a,/3}, where 

A 2 3 4 5\ , a A 2 3 4 5\ 
a= \2 1 4 5 3 J' and/?= V2 3 4 1 2j- 

Define $ by $(a) = a and $(6) = (3. Then Figure 0321 depicts the DFA Aj,,y- 




For n > 5, define X n C T n to be a subset of transformations, and let M n denote the monoid 
generated by X n . In particular, define Xki = {a, /?}, where a and (5 are as in Lemma \'A. fil Then X^i 
generates Uk : i- We now state our main result concerning the state complexity of root(L(^4s,x n ))- 

Theorem 3.13. IfU^j C M n , for some coprime integers k > 2 and I > 3, with k + l = n, then the 
minimal DFA recognizing root(L(_4.£,x„)) has \M n \ — states. 

Before we are ready to prove this theorem, we must state a few more definitions and lemmas. 

Definition 3.14. Let p G T n . For any i,j,k, if p(i) = k = p(j) implies that i = j, then we say 
that k is unique. 

Definition 3.15. Let p G T n have rank 2, with img(p) = {i,j}. Then by the complement of p, we 
mean the transformation p G T n , where 

m= U, M*)=j; 
[j, if p(k) = i. 
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. (1 2 3 ■•• n-1 n\ . _ (1 2 3 ■■■ n-1 n 

For example, if p = ^ 3 2 ... 2 2 J , then p = 2 g g g 

It is easy to see that, in general, p and ~p have the same rank, and ~p~ = p. 

For an automaton M = (Z n ,H,5,zo,F), define the DFA M* = (M n ,H,S',S e ,F f ), where 5 e is the 
identity element of T n , S'(r], a) = n5 a for all 77 G M n , for all a G X, and F' = {n £ T n : n(zo) = zq}. 
Then L(M*) = root(L(M)). 

For 77, 6* G M n , with n ^ 0, note that 77 and 6* are equivalent states if and only if for all p G X* we 
have that 

<$'(?/, p) G F , <*6'(9,p) G F'. 

However, since 5'(rj,p) = n5 p , this is equivalent to saying that 77 and 6> are equivalent states in M 
if and only if for all p G M n , we have 

7?p G F' ^ 8p G F'. 

Lemma 3.16. Let 7CT„ generate M n , and let M. = (Z n ,T,,S, zo, F) be an automaton based on 
Y such that Zo G F. Let n, G M n , with 77 / and rank(7/) = 2. If t](zq) is unique in the image 
of n, and rj = 0, then 77 and are equivalent states in M* . 



Proof. We have 



and 



1 2 • • • zo — 1 zo ^0 + 1 • • • n 
j j ■■■ j i j ■■■ j 



1 2 • • • Zq — 1 Zq Zq + 1 ■ ■ ■ 77, 

i i • • • i j i i 



for some i / j. 

If 77(^0) G F, then 77 G F'. If 0(z o ) G F, then G F'. Otherwise, 9(z ) G Z n \{z }. Since 
0{z) = 77(^0) G F for all z G Z„\{zo}, we have that 2 (zq) G F, and hence G F' . Similarly, 
9{zq) G F implies that 77, 9 G F'. Furthermore, if img(77) n F = 0, then img(7/ n ) n F = 0, for all n, 
and hence ?7 n (2:o) G" F so that 7/ G" F'. Since img(77) = img(0), this gives 9 G" F'. Therefore 77 G F' 
if and only if G F' . 

Let p G M n . Since 77 and 9 have rank 2, we must have that np and 9p have rank < 2. If 77/? has 
rank 2, then p{i) / p(,?'), so that rfp = 9 p. Hence np G F' if and only if 6p G F'. The argument is 
the same for the case where Op has rank 2. Now, if np has rank 1, then we must have p = pVp", 
where cr(s) = cr(i) for some s and i, and where p' is a permutation such that either np'{zo) = s 
and np'(z) = t, for all z 7^ zo> or np'(zo) = t and np'(z) = s, for all 2 7^ zo- Without loss of 
generality, assume the former. Then clearly Op'(zo) = t and Op'(z) = s, for all z / zq. It follows 
that np' a = Op' a, so that np = Op. 

Therefore 77 and are equivalent states. □ 
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Lemma 3.17. Let Y C T n generate M n , and let A4 = (Z n , E, S, Zq, F) be an automaton based 
on Y, such that zq F. Let n, 9 £ M n> with 77(20) unique in the image of n, rank(r/) = 2, and 
img(?7) = img(0). If 9(zq) = 77(20), and if 9(z) = 7/(20) implies that z £ F, then 77 and 9 are 
equivalent states in A4* . 

Proof. If 77(2:0) G -F\ then 77 £ F' . Since #(20) = r l{ z o)i it follows that 9 £ F' . Now suppose that 
77(20) £ F. If 77 G F', then since rank^) = 2, we must have that 77 2 (zo) £ F. Now #(20) ^ 
so 9(9{zq)) 7^ 77(2:0). But since rank(#) = 2, and img(0) = img(7/), it follows that 6> 2 (2o) = t] 2 {zq). 
Hence 9 G -F'. If 77 i* 1 ', then 77(2:0) = 20 or img(77) n F = 0. In either case, this implies that 9 F' . 
Therefore n £ F' if and only if 9 £ F' . 

Let p £ M n . Then, following an argument similar to the one used in the proof of Lemma 13.161 we 
have that np £ F' if and only if 9p £ F' . Therefore 77 and 9 are equivalent states. □ 

Now that we have a characterization of equivalent states in the general case for M* , we turn our 
attention toward the specific case, for x . 

Lemma 3.18. Let 77, 9 £ M n , with 77 7^ 9. If ranker/) = 1, then 77 and 9 are not equivalent states 
in A% Xn . 

Proof. Since 77 has rank 1, we have that img(77) = {21} for some z\. If 77(1) 7^ 9(1), then take 
p £ Ukj such that p(z\) = 2, and p(z) = 1, for all z 7^ z\. Then img^/?) = {2}, so that np ^ F'. 
But 9p(l) = 1, so that 9p £ F' . Hence 77 and 9 are not equivalent. If 7/(1) = 9(1), then rank(#) 7^ 1 
so that for some 22 / 1 we have 9(z 2 ) 7^ z\. Take p £ L 7 ^ such that p(9(z 2 )) = 1, and p(z) = z 2 , 
for all 2 7^ 9(z 2 ). Then img(??p) = {z 2 }, so that np F' . But (9p) 2 (l) = 1, so that 9p £ F' . Hence 
77 and 9 are not equivalent. □ 

Lemma 3.19. For 77, 9 £ M n , with 77 7^ 9, let 77 have rank 2. Then 77 and 9 are equivalent states 
in _4.£ x if and only if n(l) is unique in the image of n, and rj = 9. 

Proof. Let img(77) = {i, j} for some Without loss of generality, assume that 77(1) = i. 

Since 1 £ F, Lemma 13.161 applies and gives the result in the forward direction. For the other 
direction, we have two cases. 

Case 1. i is not unique in the image of 77. 

Case l.a. i = 77(1) / 9(1). 

Since i is not unique, n(z) = i, for some 2 7^ 1. Choose p £ Uk,i such that 
p(i) = 2, and p(9(l)) = 1. Then np(l) = np(z) = 2 so that (np) n (l) = 2, for 
all n > 0. This gives us np F' . But 9p(l) = 1, so that 9p £ F' . 

Case l.b. i = n(l) = 9(1). 

Case l.b.i. img(77) = img(0). 

Since img(?7) = img(#) and 77 7^ 9, then for some 27^ 1, we must 
have either % = 77(2) 7^ 9(z) = j, or j = 77(2) 7^ 9(z) = i. Without 
loss of generality, assume the former. Then choose p £ Uk i such 
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that p(i) = z, and p(j) = 1. Then (rjp) n (l) = z, for all n > 0, and 
(9p) 2 (l) = 1. This gives us r]p G" F' and 6>p G F' . 

Case l.b.ii. img(7/) 7^ img(0). 

If rank(0) = 1 then by Lemma 13.181 7/ and 9 are not equivalent. 
Otherwise there exists some z\ G Z n such that 0(zi) G" img(ry). Take 
p G Ufe i such that p(0(,8i)) = 1, and p{z) = 2, for all z / 0(zi). Then 
r/p 7^ 0p and rank(r/p) = 1, and so by Lemma l3.181 rjp and 0/5 are not 
equivalent. Hence 77 and 6 are not equivalent. 

Case 2. z is unique in the image of 77, and r\ 7^ 8. 

If rank(0) = 1 then by Lemma 13.181 77 and 9 are not equivalent. If img(^) = img(0), 
then since 77 7^ 9, we have that 0(1) is not unique. So we can reverse the roles of 77 and 
9 and apply case 1 to get the desired result. Assume then, that img(?7) 7^ img(0), and 
rank(0) > 2. Then the fact that 77 and 9 are not equivalent follows just as in case l.b.ii. 

Therefore, 77, and 9 are equivalent states if and only if 77(1) is unique in the image of 77, and 
77 = 9. □ 

Lemma 3.20. Let 77, 9 G M n , Twzt/i 7/ 7^ 0, iff], 9 have rank > 3, then 77 and 9 are not equivalent 
states in x . 

Proof. Since 77 7^ 9, there exists some z\ G Z n such that 77(2:1) 7^ 0(zi). Let Z2 = i](zi). Take 
p G C/fc^ such that p(z2) = 1, and p(z) = 2, for all z ^ zi- Since rank(7/) > 3, we have rank(rjp) = 2. 
If 77/3(1) is not unique, then by Lemma 13.191 rjp and 9p are not equivalent. Hence 77 and 9 are not 
equivalent. If 77/3(1) is unique, then it must be that Z\ = 1. Furthermore, since rank(0) > 3, we 
cannot have 9{z) = z<i for all z 7^ 1, so we cannot have 0/3(2) = 1 for all z 7^ 1. Therefore 0/3 7^ ?7p. 
Then by Lemma 13.191 r/p and 9p are not equivalent. Hence 77 and 9 are not equivalent. □ 

We are now ready to prove Theorem 13.131 

Proof (Theorem \3.13\) . Lemma 13. 181 - HT.2UI cover all possible cases for 77, 9 G M n , 77 7^ 9. Therefore, 
two states are equivalent if and only if they satisfy the hypothesis of Lemma f3.16l There are Q) such 
equivalence classes in U^i Q M n , each containing exactly 2 elements. All other elements of M n are 
in equivalence classes by themselves. It follows that the minimal DFA recognizing root(L(„4.£.x n )) 
has \M n \ - (™) states. □ 

Now that we have established a close relationship between sc(root(L)) and the transformation 
monoid of the the minimal automaton recognizing L, we can take advantage of results concerning 
the size of the largest monoids to give bounds on the worst-case blow-up of the state complexity of 
root(L). The following corollary gives a lower bound for alphabets of size two. It also proves the 
existence of a sequence of regular binary languages with state complexity n whose root has a state 
complexity that approaches n n as n increases without bound. 

We now state our first main result. 
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Corollary 3.21. For n > 7, there exists a regular language L over an alphabet of size 2, with 
sc(L) < n, such that 

sc(root(L)) >n n \l-V2 (^j ' e^ - V8_L e T2 

Proof. The result follows from a combination of Theorem 13.71 and Theorem 13.131 □ 

Our results from Theorem 13.131 do not apply when 1 = 2. Unfortunately, Theorem 13.81 does not 
exclude this possibility. To guarantee that this fact is of no consequence, we must show that not 
only is the monoid f7 n -2,2 never the largest, but that it is at least smaller than the largest 
monoid. The following lemma deals with this. 

Lemma 3.22. For n > 7, we have that 

\U2,n-2\ — \Un-2,2\ > 

Due to space constraints, the proof of this lemma has been relegated to the appendix. 

The choice of start and final states in the construction of the DFA A.t,,x„ is the best possible. The 
following theorem will show that for any other DFA with the same transition function, a different as- 
signment of start and final states will not increase the state complexity of the language it recognizes. 





Theorem 3.23. Let Y C T n generate M n , and let M = (Z n , E, 5, zq, G) be an automaton based on 
Y. Then sc(root(L(.M))) < sc(root(L(^ s>Xn )). 

Proof. If zq E G, then Lemma 13 . 161 applies . It follows that there are at least pairs of equivalent 
states in M*. If zq G, then Lemma 18. 171 applies . and again we have at least Q) pairs of equivalent 
states in M*. In either case, this gives 

sc(root(L(7W))) < \M n \ - Q < sc(root(L(^ SjX J). 

□ 

We now state our second main result. 

Corollary 3.24. For prime numbers n > 7, there exist positive, coprime integers k > 2, I > 3, 
with k + I = n, such that if L is a language over an alphabet of size 2, with sc(L) < n, then 
sc(root(L)) < |Z7 fc)I | - (£). Furth ermore, this bound is tight. 

Proof. Let U 1 denote the largest two-generated submonoid of T n . Then by Theorem l3.8l and Lemma 
13.221 we have that U' = Uyy for some coprime integers k' > 2, I' > 3 with k! + V = n. 

Let A4 be the smallest DFA recognizing L, and let M be the transformation monoid of Ai. If 
M is of the form U^i, with k > 2, I > 3, then \Uk,i\ < W\- It follows from Theorem 13.231 that 
sc(root(L)) < \U ki i\ - (™) < \U'\- (J). If M is of the form U ktU with k = n - 2, I = 2, then by 
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Corollary 13. II and Lemma T3. 221 we have sc(root(L)) < \U n -2,2\ < |f 2,71—2! ~~ (2) — l^'l ~~ (2) • 

Let V denote the largest two-generated submonoid of T n that is not of the form Uk,i for some 
coprime integers k > 2, I > 3 with k + 1 = n. Then for all integers n > 81, a simple observation of 
Holzer and Konig's proof of Theorem 13.81 shows that \U'\ — \ V\ > (^)- For all integers 7 < n < 81, 
the fact that \U'\ — \V\ > (q) has been verified computationally. It follows from Corollary 13. II that 
if M is not of the form Uk i, we have 

sc(root(L)) < |M| < \V\ < \U h ,i 

The fact that the bound is tight is an immediate consequence of Theorem 13.131 □ 

If Conjecture 13.91 is true, then for all n > 7, where n is not prime, the construction of -4.£,x fc , 
yields a language that is within Q) of the maximum blow-up. We conjecture that this construction 
achieves the maximum. 

Conjecture 3.25. For and integer n > 7, there exist positive, coprime integers k > 2, I > 3, 
with k + I = n, such that if L is a language over an alphabet of size 2, with sc(L) < n, then 
sc(root(L)) < \Uk : i\ — (2)- This bound is tight. 

The results concerning the largest monoid on > 3 generators are definite and much simpler. For 
this reason, on alphabets of size > 3 we are able to give a much better bound. 

Lemma 3.26. For n > 1, if M C T n is a monoid such that \M\ > n n — Q); then M — T n . 
Proof. For 1 < n < 3, the result can easily be verified computationally, so assume that n > 4. 

There are Q) transpositions in T n . Since |M| > |T n | — (2), it follows that M contains at least one 
transposition. There are (n — 1)! permutations of Z n that have one cycle of length n. Since n > 4, 
we have that (n - 1)! > (™). A gain, considering the size of M, it follows that M contains at least 
one permutation that is a full n-cycle. It follows that S n C M. 

Furthermore, there are (2) ■ n\ transformations of Z n that have rank n — 1, so it follows that M 
contains at least one transformation of rank n — 1. Then by Lemma l3,5[ we have that M = T n . □ 

We now state our third main result. 

Theorem 3.27. Let E be an alphabet of size m > 3. For n > 1, if L is a language over S with 
sc(L) < n, then sc(root(L)) < n n — (g). Furthermore, this bound is tight. 

Proof. Define M to be the transformation monoid of the smallest DFA recognizing L. If |M| < 
n n — (2), then certainly sc(root(L)) < n n — Qj. So suppose that |M| > n n — (2) . Then it follows 
from Lemma 13 . 261 that M = T n . 

For 1 < n < 6, it has been verified computationally that if the transformation monoid of the 
minimal DFA recognizing L is T n , then sc(root(L)) = n n — Q)- F° r n > 7, if the transformation 
monoid is T n , then clearly U^i C T n for some suitable k, I so that Theorem 13. 131 applies, and hence 
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sc(root(L)) = n n - (™). 

To show that the bound is tight, it suffices to show that for any n there exists a language L over 
E such that the transformation monoid of the minimal DFA recognizing L is T n . Let X be a set of 
transformations such that \X\ = min(n, 3) and X generates T n . For n £ {1,2}, the fact that such 
an X exists is easy to check. For n > 3, the existence of X follows from Lemma 13.51 Then the 
language L(Ay: ) x), gives the desired result. □ 
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Appendix: Omitted Proofs 



Proof (Lemma VS \2ty . As stated in [S], for k + I = n, we have the following formula 



r I I z — r 



where {"} is a Stirling number of the second kind, the number of ways to partition a set of n 
elements into i non-empty sets. This gives 



. .i — k J \i — 1/ J \\i \ I t J 1 i — r 

1=1 xx 7 x / / \ k j r=l v y v 



Since {^} = whenever k > n or A; < 1, for k = 2, and / = n — 2, we have 

i 

E 



r=l 

/n-2\ / 2 



(21 




fn-2] 




fn-2] 




fn-2] 




H 




hi 




hi 


h-ij 


UJ 




[ i — r J 




h-2j 



Also, notice that (™_ 2 ) — G-n+2) ^ s positive when 2 > i > n — 1, and zero otherwise, so that (x 
becomes 



And finally, using the identity {£} = {£_}} + fc{ n fc }, we see that 

nl fn — 2l , . fn — 2] . , fn — 2 

+ (2i-i) A + a-i) 



v i I 1 i — 2 I I i — 1 I I « 

so that we get 

i^- 2 i - i^- 2 , 2 i > - 2 ){ri 2 } + {i - i} { n 7 2 })* ! - - (2 



□ 
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